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Abstract 

We compute the pressure of the random energy model (REM) and generalized ran- 
dom energy model (GREM) by establishing variational upper and lower bounds. 
For the upper bound, we generalize Guerra's "broken replica symmetry bounds", 
and identify the random probability cascade as the appropriate random overlap 
structure for the model. For the REM the lower bound is obtained, in the high 
temperature regime using Talagrand's concentration of measure inequality, and in 
the low temperature regime using convexity and the high temperature formula. The 
lower bound for the GREM follows from the lower bound for the REM by induction. 
While the argument for the lower bound is fairly standard, our proof of the upper 
bound is new. 
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1 Introduction 



While the analysis and properties of finite range spin glass systems (like the Edwards- 
Anderson model) is still a very debated issue, even in the physics community, in recent 
years there has been a large progress in the mathematical understanding of mean-field 
models [22] • This advance was triggered by the introduction of a "quadratic interpola- 
tion" technique, pioneered in ^J] to establish the existence of thermodynamic limit for 
the Sherrington-Kirkpatrick (SK) model [21] and further developed by Guerra in ^H] to 
prove an upper bound for the pressure which coincides with the Parisi replica symmetry 
breaking solution of the model. Motivated by the cavity picture, this bound was gener- 
alized to a variational bound by the introduction of Random Overlap Structures (ROSt) 
and associated random weights in .1]. Guerra's bound is recovered when the weights 
are chosen from the inhomogeneous Poisson point process studied by Ruelle J2E] and the 
ROSt is a hierarchical one. Recently Talagrand |30] was able to prove that Guerra's 
bound is optimal, by showing that the correction term in the bound goes to zero in the 
thermodynamic limit, thus establishing the rigorous validity of the Parisi solution. 

In this paper we solve the Generalized Random Energy Model (GREM) ^Sl Ej by a 
simple analysis that partially follows the one developed for the SK model. We first obtain 
a variational upper bound for the pressure by the definiton of the appropriate auxiliary 
system (i.e., ROSt) for the model. As a guideline in this step we use the basic covariance 
inequality that was identified in 9j. To show that the upper bound is optimal when the 
ROSt random weights are chosen according to the Poisson-Dirichlet point process we use 
a different strategy than the one developed for the SK model. Indeed the corresponding 
lower bound is easily obtained from a complete control of the high temperature region 
and convexity of the pressure. To study the high temperature region we propose a new 
induction argument, which starting from infinite temperature covers all the temperature 
values up to the critical one. 

Despite the recent progress, there is not yet a direct proof of the most prominent prop- 
erty of the Parisi solution, namely ultrametricity. A distance d in a metric space is said 
to be ultrametric if the standard triangular inequality is replaced by the much stronger 

2 



inequality d(x,z) < max{d(x, y), d(y, z)}. Equivalently, in an ultrametric space all trian- 
gles are equilateral or isosceles with longer equal sides. The Generalized Random Energy 
Model (GREM), introduced by Derrida [T3] as a model that possesses replica symmetry 
breaking, is ultrametric by construction, since its Hamiltonian is defined as the sum of 
independent Gaussian random variables positioned on the branches of a hierarchical tree. 
As a first step in the direction of achieving a proof of ultrametricity for mean-field models, 
generalized non-hierarchical models have been considered in |HJ, where it has been shown 
that they exhibit GREM-like behaviour. The analysis of the present paper, which sets 
the GREM model into the general variational scheme developed for the SK model, could 
be helpful in further studying non-hierarchical models. 

The paper is organized as follow. In Section 2 we treat the basic case of the Random 
Energy Model (REM), where the energy levels are independent Gaussian random vari- 
ables. This is a necessary warm-up since the GREM model will be treated as a nested 
succession of REM-like systems. The full analysis is presented in Section 3. In the Ap- 
pendix we recall some useful concentration of measure estimates. 

Acknowledgements: We thank A. Bovier and A. CD. Van Enter for helpful com- 
ments and suggestions to improve the presentation of the results. We also thank A. Bianchi 
for early discussions, and an anonymous referee for numerous helpful improvements. C.G. 
acknowledges P. Contucci and S. Graffi for their encouragement in this work. His research 
in conducted under financial support of nwo-project 613000435. 



2 REM 

The random energy model (REM) is a statistical mechanical model, where the energy lev- 
els are independent and identically distributed Gaussian random variables. More precisely, 
for a system of size N, the Hamiltonian is a Gaussian centered family with covariance 
matrix 

N 

C h (<t, a') := E[H N (a)H N (a')] = - 5(a, a') (1) 
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where a, a' G {+1,-1}^ are vectors, whose components are Ising spin variables. The 
dependence on N in (P) is such that thermodynamic observables (energy, free energy, etc.) 
are extensive in the volume, while the factor 1/2 is included as a matter of convention. In 
this paper we denote by X a standard Gaussian, E [X] = 0, E [X 2 ] = 1, thus an explicit 
representation of the Hamiltonian is 

H N (a) = A» (2) 

where {X(cr)} cre s JV are 2^ i.i.d. copies of the random variable X. An equivalent repre- 
sentation, more in the spirit of statistical mechanics, is to consider a "lattice" A with 
N = |A| sites and a random Hamiltonian 



^) = V^iE J ^ ( 3 ) 

defined on the spin configurations a : A — ► {+1,-1}. In (J3J) the Jx's are a family of 
i.i.d. Gaussian random variables, with E[J|-] = 1, and ax '■= ELex * ^ or eac h X <Z A. 
Henceforth, all Gaussian random variables will be understood to have expectation equal 
to 0. 

The main quantity we are going to study is the quenched pressure. We denote by 
= {— 1,+1} the space of all possible spin configurations. For a finite system we 
define the random partition function 

zn(p) = E e ~ PHN{<7) > w 

and the quenched pressure 



Pn(J3) = E 

We will be interested in the thermodynamic limit 



(5) 



P(P) = lim P N (0) (6) 

N^oo 

In the following we will sometimes drop the N— dependence in the Hamiltonian H(o~) in 
order to alleviate notation. We will also introduce additional randomness by considering 
an auxiliary system which is coupled to the Hamiltonian. We will denote by E [•] the 
expectation with respect to all random variables involved. 



2.1 Upper bound 

We start by recalling the quadratic interpolation technique. 

Lemma 2.1 Let H{a) be a Gaussian family, indexed by a G £jv> with covariance C h{°~\ °~') 
Let a G A be an index ranging over the set A, and let K(a) and V(o~, a) be Gaussian 
random variables, independent of H(a) and of each other, with covariances Ck{ol\ a 1 ) and 
Cv{o-,a]cr',a'), respectively. Suppose that 



Ch{o~] a') + Ck(cc, a') > CV(cr, a; a', a') 



(7) 



for all a, a' G Sjv and a, a' G A, and suppose that 

C H (o-;a) + C K (a;a) = C v (o-,a;a,a) (8) 

for every o G Sjy and a G A. Moreover, suppose that there is a random weight w : A — > 
[0, oo) such that, almost surely, J2aeA w ^ ^ strictly positive and finite. Then, 



Pn(/3) < E 
= E 



1 E n w(a)e-P y ^ 

N 11 Ea W («) e "^ (a) 



(9) 



— lnV^ w(a)e~ 



■PV(a,a) 



E 



— lnV^ w(a)e~ 



-PK{a) 



as long as the right-hand- side is well-defined (i.e., not oo — oo). 

Proof: We refer to [T§1 for full details. Here we only recall the basic idea. For 
t G [0, 1] define an interpolating Hamiltonian 



H(a, a; t) = ^/l^t [H{a) + K{a)\ + Vi V{a, a) . 



(10) 



and an associated random partition function 



z N , t (P) = J2 w ^ e ' (3iIia ' a '' t) ■ 



id 



Let fijv,/3,t denote expectation with respect to the multiple-replica product measure, where 
the weight for a configuration (a, a) of a generic copy is given by Gibbs measure associated 
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to H(o~, a) times a generic weights w(a). In particular, for a function f(a, a, a', a') of two 
replicas, one has 

/ e -/3[H(a,a;t)+H{a' >a ';t)} 

tt N ,p,t{f( a ' a > «')} = V V , -2 f(a, a; a', a') . 

— <a,a — 'a', a' Zjj t 

Then one actually has (because of the equality along the diagonal Eq. (JHJ)) 
i-EhnV w(a)e-P y ^ a) ] -i-E[lnV w{a)e^ K{a) ] - P N {(3) 

N L ^ — 'o-jos J TV L ^ — 'a J 

^ Jo 

This is proved by differentiating the quantity 

ELY" 1 In V w(a)e- /3 ^ (<T ' a;t) ] , 

with respect to t and using the generalized Wick's rule. Because of (J7J), the right hand 
side of (fHZ)) is obviously positive and equation Q follows. □ 

Remark 1 The same basic argument works to bound K[F(Z^(f3))} for other functions 
such as F(z) = z a . 

Remark 2 An identity such as Alfy) is usually called a sum-rule. The process K(a) has 
to be thought of as a large reservoir which acts on the original system H(a) through the 
interaction V(a, a). 

We are going to use the previous lemma to establish an optimal upper bound for the 
REM model. A key element is to choose the correct formula for the random weight w(a). 
The correct formula for mean field spin glasses seems to generally be given by Ruelle's 
random probability cascade. For the REM, it is given by a single level of that, which is 
sometimes called the Poisson-Dirichlet process 1 . Let us give a brief description of this 
(the unnormalized version) in order to facilitate the following proposition. 

Given < m < 1 consider the Poisson point process on (0, oo) with intensity mea- 
sure equal to mw~ m ~ l dw. Almost surely, the points can be labelled {wi, W2, ■ ■ ■} with 
1 See but also see j2H|. For a rather more abstract version, see 0]. 
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w\ > u>2 > • ■ • > 0. Moreover, Y^=i w a is strictly positive and finite, almost surely. 
The distribution of {w a } a has a remarkable invariance property: If (fx, / 2 , . . . ) are i.i.d. 
copies of the random variable, /, which are assumed to be independent of {w a } a , then 
(modulo permutations) the distribution of {e^ a w a } a is the same as {cw a } a where c is the 
nonrandom number c = (E[e m ^]) 1//m . This is easily proved using the generalized Laplace 
transform. (For a proof see, for example, reference page 481.) 

One may note that instead of considering the Poisson point process {w a } a with inten- 
sity equal to mw"" 1 " 1 dw, one could instead consider w a = e Va / m for some {y a } a - Then 
— oo < • • ■ < y 2 < Hi < oo is a Poisson point process on R with intensity measure e~ y dy 
(independent of m). One thinks of 

y<y. s 

the free energies. In this notation m is 
explicit. Henceforth {y a } a will refer to the point process just described. 
A consequence of the invariance property mentioned above is 

oo oo 

]Te^ m exp(/ a ) Z E[exp(m/)] 1/m ]Te^ m , (13) 

a=l a=l 

which will be useful. 

Proposition 2.2 Let Ch(ct, cr') be given by Eq. (OJ). Choose w(a) = exp[y(a)/m] for 
< m < 1. For each b > 1 let 

N 

C K {a,a') = (b- 1)— 5(a,a') (14) 



N 

Cv(c, a; a', a') = b — 5(a, cr')5(a, a'). (15) 



Then one obtains the optimal upper bound for the REM, 



Pn(P)< inf 

0<m<l 



\m(3 2 + — In 2 
4 m 



(16) 



Proof: We note that Lemma EH) is applicable because 

b 5(a, a')5(a, a') < 5(a, a') + (b - l)5(a, a) (17) 
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We compute separately the two terms in Eq.(JHJ). For the first one, due to Eq. (JT5J) . we 
have 



E 



N 



N 



Ie 

N 



In ty(of) exp[— (3V(a, a)} 

a, a. 

In ^ exp — ^exp 
ln^Texp Z N (j3Vb 



bN 

-f3 X /—X(a,a) 



m 



a 



(19) 
(20) 



where Zjq{J5\/b\ot) are independent copies (labeled by the en's) of the random variable 
Z N {/3y/b) = E.exp \-(3^pfX{a)\ . By applying the invariance property of Eq. (jl3|) to 
Eq. pU|) with exp [/J = Z N (f3\/b] a) we obtain 



Ie 

N 


In w(a 

a, a 


)exp[- 


Ie 

N 


In exp 

a 


~y(ai) 

m 


Then we consider the second term in Eq. (j^J) • 


Ie 

N 


In exp 


~y(a)~ 

m 


Ie 

N 


In exp 


~y(oi)~ 

m 



+ 



mN 



InE 



^(M) 



(21) 



:22i 



exp[— f3K(a) 



exp 



Using again the invariance property Eq. lfTHjl with exp[/ a ] = exp 
computing the average we obtain 



Ie 

N 



Ie 

N 



exp 



In exp 



y(Qj 
m 

m 



exp[-pK(a)] 



(23) 
(24) 

and 

(25) 
(26) 



Putting together Eq.(j22J) and (j2E|) we obtain: 



miV 



■ InE 



zwVb) 
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2 (6- l)m 



(27) 



Now we use the simple fact that 

Z^(pVb) < Z N (mpVb). (28) 

This is a general fact in statistical mechanics: since the entropy is positive by definition, 
the free energy is increasing in (3. Indeed, considering /jv(/3) = — j^q ln(Z N (f3)), the 
random free energy, one immediately checks that f' N {[3) = \ (un(/3) — /at(/3)) = sjv(/3) > 
0, where un((3) is the random internal energy, and sn((3) is the random entropy Therefore, 
for any < m < 1, we have: 

' \n(Z N (m(3)) < ln(^(/3)) , (29) 



Nm(3 v v ' " _ AT/3 
which is equivalent to (J28|) when we replace /3 by /?Vo. By inserting Eq. (J2%|) into Eq. (J27|) 



it is now easy to compute the expectation E 
bound 



Zjv(m/3\/6) and we arrive at the upper 



Pn{P) < -m(3 2 + -\n2 . (30) 
4 m 

Note that there is not anymore dependence on b in the bound. Finally the optimal bound 
is obtained by minimization in m which yields the (fTT)|) and completes the proof. □ 

Remark 3 It is interesting to note that for the REM (and for the REM only) the main 
result of quadratic interpolation can be viewed as an implementation of Jensen's inequality: 



E[ln(Z N ((3))} = -E[ln^(/3)] < -lnE[^(/3)]. (31) 
m m 

This holds for general m > 0, but one needs m < 1 to apply If one uses the "sum- 

rule" then one can get an explicit form for the error coming from Jensen's inequality in 
this case. 

Remark 4 The inequality of Mb}) takes two different forms depending on whether (3 is 
greater than or less than (3 C := 2-i/ln(2). For (3 > (3 C , the right-hand- side of Mb]) is 
optimized at m = (3 c /(3. For (3 < f3 c the infimum over < m < 1 is attained by a limit 
m — > 1. Therefore, one has 

Pn((3) < Qtf) , (32) 
9 



where 



1 /3 2 + ln(2) for(3<(3 c 



/3VM2) forf3>f3 c 
Note that the two pieces match at (3 = (3 C . 



\(P 2 + P!) for(3<(3 c 
Wc for(3>(3 c 



(33) 



2.2 Lower bound 

The main new result in this paper is the adaptation of the quadratic interpolation method 
to obtain an asymptotically sharp upper bound on P/v(/3), as we just considered above. 
One should view the result of the previous section as an analogue, for the REM, of Guerra's 
bounds for the SK model in |18j . On the other hand, for the REM, unlike for the SK 
model, there are easy proofs of the same lower bound in the N —>■ oo limit. The exact 
formula for the pressure of the random energy model is well-known. Derrida calculated 
it when he introduced the model in while a mathematically rigorous version of his 
argument is included in [2J|2II|. There are also proofs which rely more on large- deviation 
theory, such as [TBI CSJ 2 - We also encourage the reader to see the more recent, deep 
analysis of [HIE!- (There is much interest in the REM as far as the statistics of energy 
levels is concerned because in the bulk there is a kind of universality. See, for instance, 

HO 

We will also present a proof of the lower bound. This is included primarily for com- 
pleteness, for nonexperts. However, let us digress briefly to justify this for the experts: 
there is a great desire to obtain a purely variational proof of the lower bound for the the 
SK model. In light of that, it seemed worthwhile to explore how 'variational' the proof of 
the lower bound really is, for the REM. (Of course, it is not as variational as one would 
like). 

Since P N (/3) < Q(f3) for all N, it follows that P((3) < Q((3) in the limit iV -> oo. We 

want to show the opposite is also true, to establish that P((3) = Q(f3) for all /3 > 0. The 

key to obtaining the lower bound is to understand the high temperature region, (3 < (3 C . 

2 In there is also a conjecture for the more general quantity iV -1 \nE[ZN(f]) a ] for a € K. This is 
substantiated in the preliminary section of |31| . 
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Proposition 2.3 

P(0) = QXfi) for (3<(3 C . (34) 

The proof of this result is provided in the next subsection, while here we stress its 
consequences. Stated otherwise, the upper bound of Proposition 12.21 saturates in the 
N — > oo limit, at least when (3 < (3 C . It is a remarkable fact that this high-temperature 
result gives the sharpness of the upper bound also in the low-temperature region as follows. 

Corollary 2.4 For (3 G [0, +oo) 

P((3) = Q(/3). (35) 

Proof: It is a basic fact, easily seen from the definition (JSJ), that Pn{/3) is convex in f3 
for each N. Therefore, the limiting function P{(3) is also convex. Hence, for any (3q and 

any (3 > flo, we have 

P{P) > PW + Po)DP(Po) , (36) 

where D is any convex combination of the left-handed and right-handed derivatives, which 
we denote _D_ and D + , respectively. We now take (3$ j (3 C . Since we know from Proposition 
IO that P(J3) = Q(f3) for (3 < (3 C , we easily calculate lim^^ P((3 Q ) = 21n(2) while 
]imp ip c D_P([3o) = \imp ^/3 c Q'((3q) = y/\n(2). Putting this together completes the proof. 
□ 

2.3 High temperature region 

The proof of Proposition 12.31 will be obtained throught a sequence of lemmata. The crux 
of the argument is standard. For example, see [2H], Proposition 1.1.5. (there is another 
approach in 5], Theorem 9.1.2, called the "truncated second moment method"). We start 
with the following result, which it is another variational calculation. 

Lemma 2.5 Let Qp^N refer to the (expectation associated to the) random probability mea- 
sure on x specified by 

11 



For < P < p c we have 



±lnE[Z N ((3)^ N {6(a,a')}] < ^ 



Proof: Note that 



n f3:N {S(a, a')} 



Z N {2(3) 

z 2 N (P) 



By (J28J), we know that for < m < 1 



(37) 



(38) 



Z N (2P) < Z 1 J m (2mP) 



(39) 



Using Holder's inequality with p = 1/m, q = 1/(1 — m) and - + - = 1 



Z N (2m(3) < Z™{{3) Z L N 



l—m 



rn 



l — m 



P 



Therefore, 



Z N (p)^ N {5(a,a')} = < -v 



Z 1 J m (2mP) 



< Z 



(l-m)/m( m 



N 



j— —I 3 ) ■ 

l — m / 



(40) 



z N (p) - Z N (P) 

Since mj (1 — m) can take any positive value as m ranges over (0, 1) this means 

Z N (P)^ N {5(a,a')} < Zf{rp) (41) 
for every r £ [0, +oo). Moreover, for r > 1 we can use Jensen's inequality to obtain 



nzf(rP)} < (E[Z N (rP)]) 1/r = exp [N 



l/r 



(3! , rp> 



4r 



+ 



(42) 



It is easy to see that the optimal value is r = P c /P, which does satisfy the constraint r > 1 
because of the hypothesis P < P c . Choosing this r and putting (J4T|) and (|32*Jl together 
yields (jSTJ)- □ 

A second useful estimate is the following concentration of measure property. 
Lemma 2.6 For any P, and any t > 0, 

F{\N- 1 \nZ N (P)-P N (p)\>pt} < 2e- Nt2/2 . 
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The analogous result for the SK model is Corollary 2.2.5 of [29]. For completeness we 
will give a proof of Lemma 12.61 in Appendix A and, particularly, show that a straight- 
forward generalization of Talagrand's proof applies equally well to all models that satisfy 
thermodynamic stability [TU] IT2*] . 

The proof of Proposition 12.31 will essentially follow from the next result, which we 
prove first. 

Lemma 2.7 For any < (3 < (3 C , 

limsup sup ^-lnE[rLp, tN {6((T,cr')}] < 0. (43) 

N-*oo 0</3'</3 N 

Proof: The proof will obtained by induction. Let us define the succession of temperatures 
given by /3 = 0, /3 n +i = g(l3 n ) for neN, where g is a definite function. As we will see, 
we can choose 

(?(/?) = /? + a/? c [l - (/3//3 c )] 2 for any < a < 1/2. (44) 

and it will follow that f3 n f (3 C as n — > oo. 

We first note that is true for (3 = /3 = 0, because one has E[fi ,A r {^( o " ) a ')}] = 
2~ N . For the induction step, we will prove that if (|4~3*|l is true for (3 G [0, /3 n ], then it is 
also true for every (3 e [0, (3 n +i\- Then, since (3 n \ (3 C the statement of Lemma l2~7l follows. 

To prove the induction step, we first observe that 

^Pn{(3) = ^(l-E[n N , p {5(a,a')}}) >0. (45) 

Indeed, this is a simple calculation using the generalized Wick's rule. Since Zjv(0) = 2 N 
(deterministically) we have P/v(0) = ln(2). Then the sum rule 

P N (i3) = ln(2) + ^- J £E[n N jr{6(<r, a')}} dp' (46) 

follows. 

Suppose now that (J4~3~]) is true for (3 e [0, f3 n ]. Consider a generic (3 > (3 n , not necessarily 
smaller than {3 n+ i, and for t > let A^(f3,t) be the event in Lemma | 

A N ((3,t) = {\N- l \nZ N (f3)-P N {f3)\>(3t}. 
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On An(/3, t) c we have Zn(/3) > e N ^ PN ^~^. Let us employ the following shorthand: given 
a set A, let I4 denote the indicator of A and let us denote E[i^X] by E[X, A] for each 
and every random variable X. Then we conclude that 

E[Q^ N {5(a,a')},A N ^,tr] < E[ -^^ W] Q PjN {5(a,a')}, A N (M C ' 

< e- s ^-^E[Z N {lS)a NtP {S(a i a f )}]. 

Therefore 

-^\nE[n N)P {8(a,a')}, A N {(3,t) c ] < -P N {(3) + ^ c + (3t 

follows from Lemma f2. 51 

Since we assumed (3 > (3 n , a lower bound for Pn{0) is given by .Pjv(Ai) and a lower 
bound for P N ((3 n ) is given by \[f3l+f3 2 ]—o(l) ) using Eq. (jjQj) and the induction hypothesis, 
where o(l) represents a quantity whose limit is when N — ► oo. One thing which is 
important is that while o(l) does depend on /3 n , it is independent of (3 > (3 n . Therefore, 

^\nE[n^ N {6(a,a')},A N ((3,t) c ] < -^-(3^ + ^-^+^-0(1). (47) 

On the other hand, one always has < fi / g i Ar{5((7, er')} < 1. Hence, 

E[n PtN {8(a,a')}, A N (j3,t)] < E[l, A N (j3,t)] = F(A N (j3,t)). 

So, by Lemma ETo1 

^luE[^ jN {S{ay)}, A N ((3,t)] < -h 2 + N~ 1 hx{2). (48) 

Putting equations (|4Tj) and (}4*%|) together we obtain 

llnE[fy J) *{<f(<7,<7 / )}] 

< N- 1 ln(2) + max { - ^ 2 + N^ 1 ln(2) , - f3 n ) 2 + |/5 c [/5 - /3 n ] + /ft - o(l)} 

(49) 

If we now take 

(3<(3 n + i/3 c [l - (/3 n //3 c )] 2 
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then it is clear that by choosing t positive, but small enough, we will have a strictly 
negative limsup of the left hand side of (|49|) as iV — > oo. Choosing any < a < 1/2, let 
us take g((3) = /3 + a/3 c [l — {(3 / (3 C )] 2 . Then for (3 n+ \ = g(/3 n ), we have proved the induction 
step: for j3 in the range [0,/3 n+ i], inequality ()43|1 also holds. □ 

We complete this section with the proof of Proposition 12.31 

Proof: By Lemma \2.1\ the integrand of the third term in the right hand side of Eq. (J46J) 

approaches uniformly as iV — > oo, as long as < (3 < (3 C . This gives the desired result. 
□ 

3 The Generalized Random Energy Model 

In this section we extend the method developed in the previous section to treat the GREM. 
This is essentially a "correlated random energy model" on a hierarchical graph - that is, 
a tree. 

3.1 Set-up and Basics 

The GREM is a family of models, taking various parameters for the definition. Let n 6 N + 
be an integer, equal to the number of levels in the hierarchical tree. Let K±, . . . , K n be 
positive integers such that K\ + K 2 H — • + K n = N, where N is the system size. Also, let 
ax, . . . , a n be real numbers such that < Oj for i = 1, . . . ,n and ai + a 2 + ■ ■ ■ + a n = 1. 

Definition 3.1 Given N e N and a e E^v, for % = 1, . . . , n, let 7Tj(cr) be the canonical 
projection over the subset generated by the lexicographical partition V of the coor- 
dinates (oi, . . . , ojv) into the first Ki coordinates, the successive K 2 coordinates and so 
on up to the last K n coordinates. Namely, XV = x . . . x SJiLi 71 "* = an d 

Hi(a) = (aK 1 +-+K i - 1 +l, ■ ■ ■ , o-K 1 +-+K i ) ■ 
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Then the GREM Hamiltonian is a family of Gaussian random variables having the co- 
variance 



N 



E[H N (a)H N (a')} = - £ a { J] 5(tt», ^{a')) 



(50) 



i=l j=l 



An explicit form of GREM Hamiltonian is 

In 



H N (a) 



[n n _ 

i=i 



(51) 



where, for each i = 1, . . . , n, the family of random variables {X(7t 1 (cj), . . . , vrj(cr))} cr6 E JV 
are 2 Kl+K2+ "' +Ki i.i.d. Gaussians, and each family is independent of the others. 

Remark 5 The Hamiltonian \51)) corresponds to a tree with branching number that at 
each level is a power of two. We will stick to this case to simplify the notation, while the 
more general case of arbitrary branching number (with the constraint of having approxi- 
mately 2 N leaves in the last layer) is completely equivalent in the thermodynamic limit. 
In order to make statements that apply in the limit, we will consider sequences of N 's and 
Ki, . . . , K n such that there are rational numbers K\, . . . , n n , all nonnegative, and summing 
to 1, with Ki = NKi for each i = 1, . . . , n. 

We now prove the variational expression for the pressure of the GREM. The strategy is 
to apply the results obtained for the REM model in the previous section at each level in 
the hierarchy. In order to denote the dependence on the parameters a = (a 1; . . . , a n ) and 
), let us write the GREM pressure as 

1 



N 



E[ln Z N ((3; a, «)] . 



(52) 



and its thermodynamic limit as P^ n \(3; a, k) := lim^r^oo P^\(3; a, re). 
3.2 Upper bound 

Proposition 3.2 Consider the GREM model, for which C/f (a, a') is given by A5U\) . For 

an index a = (ai, «2, • • • , ct n ) G A n let the random weights w(a) be given by 

y(o£ t , ...,«„) 



W{ Oil, 



cxp 



~y(oLi)~ 




y{a 1 ,a 2 ) 






exp 


■ ■ ■ exp 


mi 




m 2 





(53) 
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where the Poisson point processes is now a cascade with intensity measure e~ y dy. Namely, 
y{ct\) is the usual PPP, then for each given ai, y{oti ) ct2) is an independent copy (labelled 
by ct\) of the PPP,... and so on up to y(ai, . . . , a n ) which, for each given a%, . . . , ct n _i, 
is an independent copy of the PPP (labelled by a>i, . . . , a n -i). We also choose a sequence 
< mi < m 2 < ■ ■ ■ < m n < 1 an d 



N 



C K (a, a') = (b - 1) — aj Y[5(®j,az'j 



(54) 



i=i j=i 



N 

C v (a,a,a',a') = b—^ai Sfcfr), iTj{a r )) 6(a ji a'j), (55) 

i=l j=l 

where b is a real number such that b > 1. Then we obtain the optimal upper bound: 



p£\fca,K)< inf V ln(2) + P - 

JV 0<mi<-<m„<l ^— ' Lm,- 4 

8=1 1 



rriiOi 



(56) 



Proof: It will be along the lines of the proof of Proposition 12.21 Lemma T2. II is applicable 
because for each i — 1, . . . , n one has 

i i i 

b n^.^u^o^x-) ^ n^^v)) rpKX) ( 57 ) 
j=i j=i j=i 



For the first term of Eq. Q , using Eq. (|55|) , we have 
In w(a) exp[— f3V(cr, a)] 



Ie 

N 



(5f 



"y(«i)" 




2/(^1, a 2 ) 




exp 


■ ■ • exp 


mi 




m 2 





exp 



'bN 



— ^ V^^^lC "); ^(O"), • • • , TTi(o-), «1, «2, • • • , «i) 



i=l 



Since the sum over configurations a G S^r, a G *4" can be decomposed into n sums over 
each subset 7Tj(cr) G G ^4 for z = 1, . . . ,n, the invariance property (|13J1 can now 

be applied telescopically, starting at the nth level and tracing back up to the first level. 
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After this simplification we obtain 

In w(a) exp[— (3V(a, a)} 



N 



(59) 



1e 

N 



w\a) 



N 
1 



E 



N 



E 



In w(o; 
In w(a 



rrijN 



InE 



< 



rriiN 



InE 



Z Ki (miPWbai 



i=l 1 i=l 

where in the third line we have made use again of Eq. (j28|) . For the second term of Eq.(j9*|). 
using Eq.(jS3J) and the invariance property ()13j) we have 



N 



N 



In w(a) exp[— (3K(a)} 



(60) 



w a 



:(&-!) 



+ ^ K " a ^ 7, a i m i 
i=l 



4 



Putting together Eq- fjo^j) and (pjUj) we arrive at the upper bound stated in the Proposition. 

□ 



Remark 6 In the following we make the assumption 



K\ K2 K r , 
Qi\ 0,2 O ri 



(61) 



in order to have a totally nondegenerate sequence of transition temperatures. To express 
the inequality of \5b\) in a more transparent form it is convenient to introduce a succession 

it 



of critical temperatures: for % = 1, . . . ,n let (5* — /3 c ^j ^ (where f3 c = 2y4n(2) as in the 
REM). Under the condition this implies (3* < ■ ■ ■ < (3*. Because of the constraint 

< mi < • • • < m n < 1 the optimal rrii is 



rrii = min{l, /3*/f3} 



(62) 



for i — 1, . . . , n, the value 1 being attained by taking j 1 in the infimum of Eq. \5b)) . 
Therefore, one has 

P { N n \{3;a,K) < G (B) C9;a,#6), (63) 



18 



where 



EUj^(P 2 + (P* k ) 2 ) for P < PI 

EL=il%^ + EL i+ iI a ^ 2 + (^) 2 ) for P* < P < (64) 
ELi I M /? > P* 



3.3 Lower Bound 

Let us denote the REM pressure (Eq. (j5J)) as P^ (P) 7 and its thermodynamic limit (Eq. 
(JBJ)) as P^ l \P). This is not really an abuse of notation because if n = 1 then the GREM 
is the REM, and a\ — K\ — 1. In the same way, we write Q^'(P) = Q{P), where Q(P) is 
defined in Eq. flHHj) . 

Then the lower bound is the following: 



Proposition 3.3 For all P > 0, 

P^{P;a, K ) > Q( n \P;a,K). 



(65) 



Proof: The proof will follow if we show that 

n 

Pj?\P;a,K) > Y,^PkAV^P) ■ (66) 

i=l 

Indeed, taking the thermodynamic limit N — > oo on both sides and using 

n 

Q {n \P;a,K) = 5^Q (1) (vW^/?) for for all /3 > 0. (67) 
i=i 

we obtain the Lemma statement. To prove (J66|) we introduce the interpolating pressure 

^Eln e ~ m ^ ] ( 68 ) 

with 



H(a,t) = Vt 



i=l 



y/1 - t 



i=l 



(69) 



where the X's and Y 7 s are families of i.i.d. Gaussian random variables, each independent 
from the other. A straightforward differentiation of Eq. (p)Hj) combined with integration 
by parts yields Eq. (jfifij) . □ 
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A Proof of Lemma 12.61 : Concentration of measure 



In this Appendix we prove the "concentration of measure" inequality (Lemma I2.tij) . The 
proof is a generalization 3 of the proof for the SK model (Corollary 2.2.5 of |29|). 

We show that the standard deviation inequality apply to a large class of Gaussian 
spin-glass models, which includes both mean-field (SK,p-spin,REM,GREM) and finite 
dimensional models (Edwards- Anderson, Random Field). We recall the basic result for a 
function of Gaussian variables: 

Theorem A.l (Talagrand) Consider a Lip schitz function F on M, AI , of Lipschitz con- 
stant A. If Ji, . . . , Jm are independent, standard normal random variables, then for each 
t > 0, 



This is Theorem 2.2.4 of Talagrand proves this using the "smart path" method, 

which is his adaptation of the quadratic interpolation argument. Thus, his proof in [2H] 
differs from his earlier proofs [2E| and from the proofs of others [21]. This is good for those 
studying spin glasses. Particularly, one technique is unifying and simplifying various tools. 
Another application of quadratic interpolation is continuity of the pressure with respect 
to the covariance of a spin glass Hamiltonian: Corollary 3.3 of Also, a result which 
uses the same idea, and surprisingly predates the applications in spin glasses, is Slepian's 
lemma [2*T] . 

Let us consider the general Hamiltonian given by 



where A C Z d , ax = Yli^x 17 ^ {^x}xeA > and the {Jx}xeA are a family of i.i.d. 
standard Gaussian random variables, E[Jx] = 0> E[JxJy] = ox.y- I n order to have a 
bounded quenched pressure we assume the following thermodynamic stability condition 
3 In [2HI the generalization of Corollary 2.2.5 to p-spin models is implicit: for example in the proof of 
Theorem 6.1.2. But our generalization to all models satisfying "thermodynamic stability" is new. 



P{|F(J) -E[F{J)}\ >At} < 2e 



t 2 /4 



(70) 




(71) 
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holds: there exists a constant c < oo such that 

i^E A ^ c ( 72 ) 



sup 



XcA 

We remark that this condition immediately entails the existence of thermodynamic limit 
for short-range models ^2] and it also implies the validity of the Ghirlanda-Guerra iden- 
tities both for short-range and mean-field models EH ITT] . To prove Lemma 12.61 we 
need to show that the random pressure function 



P A ( J) = In e~^ H ^ J) (73) 
is Lipschitz. For this, we note that 



P.W-P^W ^ + f-^V (74) 



On the other hand 



dt 
with 



dP A (tJ + (l-t)J') (3 ^ 

Vi|A| XcA 



(75) 



From Eq. ()74|) . by using \<Jx\ < 1? Cauchy-Schwarz inequality and the thermodynamic 
stability condition (|T2|). it then follows 



|P A (J) - P A (J')| < P Jpj || J - J'|| . (77) 

where || • || denotes the Z<2-norm. Therefore, P A («7) is Lipschitz, with Lipschitz constant 
A = /9w 2pv|- Applying Theorem lA.il it gives 



P{|P A (J)-E[P A (J)]| >t} < 2ex P (-^ij 



(7S 



This result apply equally well to all general Hamiltonian of the form (J7JJ). The REM 
model is obtained from Eq. (fTTj) with the choice |A| = N, Ax = V N2~ N (this is indeed 
Hamiltonian (J3j)). In this case the condition (J72)) gives c = 1 and the statement of Lemma 
21 is proved. 
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